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QUANTILE  STATISTICAL  DATA  ANALYSIS 


Emanuel  Parsen1 
Department  of  Statistics 
Texas  AleM  University 


Abstract. 

This  paper  presents  some  reasons  why  theoretical  and  sample  quantile  functions  should  be  routinely  used 
by  contemporary  statistical  data  analysts.  Quantile  methods  are  introduced  in  the  context  of  the  exponential 
distribution  as  a  fit  to  the  historically  important  life  table  data  of  Graunt  (1661).  Section  titles  are:  history 
of  statistics  and  contemporary  textbooks;  quantile  concepts;  identification  quantile  function;  identification 
quantile  box  plot;  tail  classification  of  probability  laws;  goodness  of  fit  plots;  IQQ  plot;  cumulative  weighted 
spacings  function  D(u);  quantile  simulation  and  distribution  of  extreme  values;  comparison  quantile  function; 
nonparametric  estimation  of  probability  density;  conclusion. 

1.  History  of  Statistics  and  Contemporary  Textbooks. 

A  central  problem  of  statistical  data  analysis  (that  was  formulated  by  19th  century  pioneers  such  as 
Quetelet  (1796-1874)  and  Galton  (1822-1911)]  is  identifying  distributions  that  fit  the  data.  In  The  History 
of  Statistics,  Stigler  (1986)  writes  (p.  268)  that  these  pioneers  emphasised  the  use  of  normal  curves  to  fit 
data;  they  ‘proposed  that  the  conformity  of  the  data  to  this  characteristic  [normal]  curve  was  to  be  a  sort 
of  test  of  the  appropriateness  of  classifying  the  data  together  in  one  group;  or  rather  the  nonappearance  of 
this  curve  was  indicative  that  the  data  should  not  be  treated  together.’ 

By  1875  Galton  ‘had  devised  a  different  way  of  displaying  the  data.  He  ordered  the  data  in  increasing 
order  and,  effectively,  graphed  the  data  values  versus  the  ranks.’  Galton  used  the  name  ‘ogive’  for  the 
theoretical  form  of  this  curve  for  a  normal  distribution;  Stigler  writes  ‘we  now  call  it  the  inverse  normal 
cumulative  distribution  function’.  I  call  this  ideal  graph  a  quantile  function  of  the  normal  distribution;  the 
graph  of  ordered  data  values,  denoted  X(j\n),  versus  (j  —  .5 )/n  or  j/(n  +  1),  is  called  the  sample  quantile 
function,  denoted  Q'(u),0  <  u  <  1. 

This  paper  presents  some  reasons  why  theoretical  and  sample  quantile  functions  should  be  routinely 
used  by  contemporary  statistical  data  analysts.  They  can  be  used  to  not  only  test  the  fit  (or  lack  of  fit)  of  a 
normal  distribution  to  data,  but  also  to  describe  other  general  families  of  distributions  and  to  identify  which 
distributions  fit  the  data. 

Textbooks  with  titles  such  as  Introduction  to  Contemporary  Statistical  Methods  omit  many  important 
topics  that  are  actually  useful  in  the  theory  and  practice  of  statistical  data  analysis.  On  my  list  of  important 
topics  (for  which  I  always  look  in  the  index  and  usually  fail  to  find)  are:  uniform  distribution,  exponential 
distribution,  order  statistics,  extreme  values,  quantile  function.  lYaditional  introductory  textbooks  describe 
methods  based  on  mean  and  variance.  To  qualify  as  ‘contemporary’  a  textbook  adds  the  following  topics: 
box  plot,  fences,  stem  and  leaf  plot,  trimmed  and  Winsorised  sample.  In  my  opinion  quantile  function 
interpretations  are  needed  for  these  topics  to  acquire  beauty  and  utility  that  will  excite  students;  however 
how  to  do  this  is  not  explicitly  discussed  in  this  paper. 

We  introduce  the  ideas  of  quantile-based  statistical  data  modeling  in  the  context  of  the  exponential 
distribution.  Let  X  be  a  continuous  random  variable  with  distribution  function  F(x)  =  PrjX  <  x]  and 
probability  density  function  /(x)  =  F*(x). 
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We  call  F[x)  an  exponential  distribution  with  parameter  A  if 

1  —  F(x)  =  exp(-Ax),  x  >  0,  f(x)  =  A  exp(-Ax),  x  >  0 
Its  mean  ft  equals  1/A,  since  (for  a  non* negative  random  variable) 

fOO  ago  f  oo 

ft  =  I  xf(x)dx  =  /  (1  -  F(x))dx  =  /  exp(— \x)dx. 

Jo  Jo  Jo 

The  standard  exponential  distribution  is  the  exponential  distribution  with  mean  1. 

2.  Quantile  Concepts. 

The  QUANTILE  FUNCTION  Q(u),0  <  u  <  1,  is  the  inverse  x  =  F~1(u)  of  the  distribution  function 
u  =  F(x).  To  find  x  =  Q(u)  one  solves  u  =  F(x). 

For  an  exponential  distribution,  one  obtains  x  =  Q(u)  by  solving  1  -  u  =  exp(-Ax);  therefore 

$(«)  =  (1/A)  log(l  -  u)-1  =  ft{-  log(l  -  u)) 

The  mean  ft  of  a  distribution  F  or  random  variable  X  can  be  computed  from  the  quantile  function  Q: 


The  MEDIAN  and  QUARTILES  of  a  distribution  F  or  random  variable  X  are  defined  to  be 

Q(.5),Q(.25),Q(.75), 

the  values  of  Q(u)  at  u  =  .5,  .25,  .75.  We  define  QUARTILE  DEVIATION  DQ  by  DQ  =  2(Q(.75)  -  Q(  25)). 

For  an  exponential  distribution,  Q(. 5)  =  u  log  2  =  .69 ft;  Q(.25)  =  plog(4/3)  =  .29 fx\  Q(.75)  =  plog4  = 
1.39 ft.  The  interquartile  range  Q(. 75)  -  Q(.25)  =  1.1/s;  quartile  deviation  DQ  —  2(Q(.75)  -  Q(.25))  =  1.2ft. 

Two  important  quantile  concepts  are  g(u)  =  Q'( u),  QUANTILE  DENSITY  FUNCTION,  and  /Q(u)  = 
/(<?(«*)),  DENSITY  QUANTILE  FUNCTION.  For  F  continuous,  F(Q(u))  =  u  and  fQ(u)q(u)  =  1.  For  a 
standard  exponential  distribution,  fQ( u)  =  1  -  u. 

Two  important  universal  measures  of  scale  of  a  distribution  are  DQ  and  1/ f  (median)  =  l/fQ(.S)  = 
g(.5).  They  approximately  equal  each  other  because  DQ  is  a  numerical  derivative  of  Q(u)  at  u  =  .5. 

How  do  we  apply  these  concepts  to  determine  distributions  that  fit  data?  Given  data  (sample)  compute 
a  sample  quantile  function  denoted  Q'{ti).  The  sample  distribution  function  is  defined  by  F~(x)  ^fraction 
of  sample<  x;  the  sample  quantile  function  Q*(u)  is  the  inverse  of  F*(u).  In  terms  of  the  order  statistics 
Jf(l;  n)  <  . . .  <  Af(n;  n)  of  a  sample 

Q'(u)  =  X{j;  n)for(;  -  l)/n  <  u  <  j/n. 

One  usually  adopts  a  continuous  version  of  the  sample  quantile  function  defined  by  linear  interpolation 
between  its  values 

Q'(0  ~  -5)/n)  =  X(j;  n),j  =  1, . . . ,  n. 

When  true  mean  ft  =  18,  and  the  distribution  is  exponential,  Q(.5)  =  12.4, Q(. 25)  =  5.2,Q(.75)  =  25. 
If  similar  values  hold  for  the  sample  analogues  of  population  parameters  (denoted  by  adding  a  tilde  (*)  to 
the  population  notation)  one  suspects,  and  conjectures,  that  an  exponential  distribution  fits. 


Table  1.  GRAUNT’S  Life  Table  (1661).  OBSERVED  PROPORTION  AND  CUMULATIVE  PRO¬ 
PORTION  IN  VARIOUS  INTERVALS  OF  OBSERVED  VALUES  OF  AGE  AT  TIME  OF  DEATH  (IN  LONDON 
1534). 


Index 

j 

Age  Interval 

<r(«o-i)) -<?-(«(/)) 

Proportion 

pU) 

Cumulative  proportion 
«(j) 

1 

0-6 

.36 

.36=  F~(e) 

2 

6-16 

.24 

.60  =  JT(16) 

3 

16-26 

.15 

.75  =  r(  26) 

4 

26-36 

.09 

.84  =  F*(S6) 

5 

36-46 

.06 

.90  =  F"(46) 

6 

46-56 

.04 

.94  =  F*(S6) 

7 

56-66 

.03 

.97  =  F"(66) 

8 

66-76 

.02 

.99  =  r{  76) 

9 

76-86 

.01 

1.00  =  f(86) 

Table  2.  GRAUNT’5  LIFE  TABLE  SAMPLE  QUANTILE  FUNCTION. 


j  01234  5  678  9=k 

u(j)  0.  .36  .60  .75  .84  .90  .94  .97  .99  1.00 

Q-(u(i))  0  6  16  26  36  46  56  66  76  86 


For  an  illustrative  example  we  consider  Graunt’s  Life  Table  data  (that  should  be  familiar  to  all  students 
of  statistics).  It  was  published  in  1661  by  John  Graunt,  in  an  attempt  to  analyse  data  dealing  with  age 
at  time  of  death  in  London.  The  original  data  was  collected  by  Thomas  Cromwell  in  1534  from  Church  of 
England  records  of  births  and  deaths.  Graunt  is  credited  with  starting  modern  statistics  by  creating  Table  1. 
Brilliant  lectures  by  James  R.  Thompson  of  Rice  University  brought  this  important  data  set  to  my  attention. 

from  Graunt’s  life  table  (Table  1)  one  computes  sample  mean  /T  =  18.22  (in  words,  the  average  age  at 
death  was  approximately  18  yean),  Q'(.25)  =  4.2,Qr(.b)  =  11.8  (median  age  at  death  was  approximately  12 
yean),  Qr(.7 5)  =  26,  DQ  -  43.6.  These  are  found  by  interpolating  the  values  of  the  sample  quantile  function 
in  Table  2. 

To  compute  sample  mean  (from  grouped  data)  we  use  formulas 

#»*  *  51  •WMJ  "  '))  +  <n«0)))(“(/)  ~  «0  ~  1)) 

•  Ewwy))  -  <r(«*o  -  i))u  -  -5(«o  - 1) + «o))) 

The  second  formula  can  be  interpreted  using  the  fact  that  1  -  u  is  the  standard  exponential  density  quantile. 

It  does  not  seem  to  be  customary  in  the  literature  to  discuss  which  distributions  fit  the  data  that  one  is 
analysing  (here  Graunt’s  life  table).  Techniques  are  discussed  in  this  paper  which  can  guide  the  statistical 
data  analyst  to  identify  and  test  standard  parametric  distributions  (such  as  the  exponential  distribution)  as 
a  smooth  distribution  that  fits  the  sample.  We  discuss  the  respective  roles:  (i)  F~(x),  sample  distribution 
function,  (ii)  <?*(«),  sample  quantile  function,  (iii)  F*(z),  smooth  distribution  estimated  from  data  (for 
Graunt  life  table,  an  exponential  distribution  with  mean  18.22),  (iv)  (?*(«),  smooth  quantile  function,  (v) 
£T(«)  -  /•!<?»),  comparison  quantile  function,  (vi)  D~( u),  cumulative  weighted  spacings,  tests  constancy 
of  ratio  of  derivatives  Q''(u)/Q'*(u),  (vii)  Q/(u),  identification  quantile  function.  The  statistician’s  problem 


is  to  develop  a  framework  which  explains  how  and  why  to  nse  these  functions  to  develop  graphical  and 
numerical  diagnostics  which  guide  us  to  identify  distributions  (such  as  the  normal  or  exponential)  that  fit 
the  data. 

3.  Identification  Quantile  Function. 

The  median,  which  we  henceforth  denote  MQ  =  Q(.5),  is  a  universal  measure  of  location.  It  is  superior 
to  the  mean  by  the  criterion  of  being  more  robust  (resistant  to  outliers  in  the  data  whose  presence  will  in 
fact  be  detected  by  the  identification  quantile  function).  But  we  recommend  the  median  not  because  of  its 
robustness  but  because  it  forms  one  of  the  tools  of  quantile  based  methods  of  statistical  data  analysis. 

Statisticians  who  favor  (or  at  least  teach)  mean  and  standard  deviation  as  measures  of  location  and 
scale  use  them  to  standardise  the  data  by  subtracting  the  mean  and  dividing  by  the  standard  deviation. 
The  quantile  based  analogy  to  standardisation  is  to  transform  the  random  variable  X  to 

XI  =  (X  -  MQ)/DQ 

whose  quantile  function  is 

QI(u)  =  (Q(u)  -  MQ)/DQ 

We  call  Q/(u)  the  Identification  Quantile  Function.  Our  motivation  for  introducing  this  function  is  that 
it  is  approximately  equal  to  the  unitised  quantile  function 

Ql(u)  «  (Q(u)  -  MQ)/Q'(.S)  =  fQ(. 5)(Q(u)  -  MQ). 

which  has  value  0  and  slope  1  at  u  =  .5.  The  probability  density  /(x)  corresponding  to  the  unitised 
quantile  function  has  been  normalised  so  that  /(median)^  1.  The  unitised  normal  probability  density  is 
/(*)  =  exp(— xx3). 

Universal  measures  of  location  and  scale  are  MQ  and  DQ.  Diagnostic  measures  of  skewness  are 

Q1(.25),QI(.7S),QIM  =  ,5(Q/(.25)  +  Q/(.75)),  -.25/Q/(.25),.25/Q/(.75); 

note  that  always  Q/(.75)-QJ(.25)  =  .5.  Diagnostic  measures  of  (left  and  right)  tail  behavior  are  QI(.Ol)  and 
QI(.99).  A  combined  measure  of  tail  behavior  (useful  for  probability  density  estimation)  is  Qf  (.99)  — Qf  (.01), 
called  the  identification  quantile  range. 

4.  Identification  Quantile  Box  Plot. 

An  identification  quantile  box  plot  is  a  plot  consisting  of  a  box  from  QI(. 25)  to  QI(.7S)  with  a  midline 
at  QI(. 5)  =  0  and  a  cross  at  QIM.  Fences  are  defined  to  be  max(-l,Q/(0))  and  min(l,Q/(l)).  Lines 
are  drawn  from  identification  quartiles  to  fences.  Data  values  outside  the  fences  are  considered  outliers  or 
out-and-  outliers,  depending  on  whether  they  are  interpreted  as  representing  long  tails  or  blunders.  One 
also  indicates  the  location  of  (sample  mtxn-MQ)/ DQ.  The  values  of  identification  quartiles  and  fences  are 
recorded  on  the  plot. 

5.  Tall  Classification  of  Probability  Laws. 

Representations  of  the  density  quantile  function  behavior  as  u  tends  to  0  or  1  is  used  to  provide  a 
quantitative  index  of  tail  behavior  which  we  call  the  tail  exponent.  It  is  used  to  qualitatively  classify  tail 
behavior  in  three  types,  called  short,  medium,  and  long.  Medium  tails  are  further  classified  in  three  groups: 
medium-short,  medium- medium,  medium-long;  a  good  summary  of  these  concepts  introduced  by  Parsen 
(1979)  is  given  by  Schuster  (1984). 

These  five  groups  reduce  to  three  groups  (short,  medium,  long)  when  expressed  in  terms  of  haxard  rate 
functions  (decreasing,  constant,  increasing).  The  right  and  left  hasard  functions  are  respectively  defined  by 

M*)  =  /(*)/(  1  -  F(x)),ho(z)  =  f(x)/F(x). 


The  right  and  left  hazard  quantile  functions  are  defined 

<?(«)  =  /<?(«*)/(!  -  u),hoQ(«)  =  /<?(«)/«• 

Our  classifications  of  tail  behavior  can  be  empirically  related  to  the  behavior  of  the  identification  quantile 
function  as  u  tends  to  0  or  1.  The  left  tail  is  classified:  0  >  Q/(.Ol)  >  —.5,  short  tail;  —.5  >  Q/(u)  >  —1, 
medium-short;  -1  >  QI(u),  medium-long  and  long  tail.  The  right  tail  is  classified  short,  medium  short,  or 
long  according  as  QI(. 99)  <  .5,  .5  <  Q/(.99)  <  1, 1  <  QI(. 99). 

For  Graunt’s  Life  Table,  QJ(.25)  =  -.17,Q/(.75)  =  .33 ,QIM  =  .5Q/(.75)  +  Q/(.25)  =  ,08,Q/(.01)  = 
-.27,Q/(.99)  =  1.47.  Experience  with  typical  values  of  these  diagnostic  measures  for  various  standard 
frequently  encountered  distributions  leads  one  to  conjecture  that  the  sample  distribution  function  F~[x)  of 
the  data  in  Table  1  is  fit  by  an  exponential  distribution  f*(i)  with  a  suitable  estimated  mean  /*'. 

0.  Goodness  of  Fit  Plots. 

To  evaluate  the  fit  of  a  model  described  by  F"(x)  or  Q'( u)  to  data  described  by  F“(x)  or  Q'(u)  one 
has  a  bewildering  number  of  options.  The  theory  of  goodness  of  fit  tests  is  concerned  with  the  theoretical 
study  of  the  many  test  statistics  available,  and  offers  little  practical  guidance  on  which  methods  to  use  in 
practice.  This  extensive  literature  can  only  be  briefly  illustrated  in  this  paper,  with  emphasis  on  graphical 
comparisons. 

One  can  compare  plots:  (1)  F“(x)  and  F~(x)  vs.  x,  on  the  same  graph;  (2)  Q'(u)  vs.  Q“(u),  called  Q-Q 
plot;  (3)  £T( tx)  =  F*(Q'(u))  vs.  u,  called  D-uniform  plot  (it  is  equivalent  to  a  plot  of  F“(x)  vs.  F~[x)  called  a 
P-P  plot).  We  recommend  variants  of  the  last  method.  One  can  interpret  D"(u)  as  sample  quantile  function 
of  the  transformed  random  variable  U‘  —  F*(X).  The  goodness  of  fit  problem  is  transformed  to  tests  of  fit 
of  U“  by  a  uniform  [0,1]  distribution  and  by  estimation  of  the  true  quantile  function,  denoted  D(u),  of  U“. 
We  call  £T(u),  0  <  u  <  1,  a  sample  comparison  quantile  function. 

When  is  exponential,  D~(u)  —  1  —  exp(-Q"(u)/p*).  Its  values  for  Graunt’s  life  data  is  given  in  Table 
3.  Figure  2  presents  a  IQQ  plot  as  a  teat  of  fit  of  Graunl’s  life  table  by  an  exponential  distribution.  Figures 
3-6  present  plots  on  same  graph  of  sample  and  smooth  distributions.  The  combinations  are  F~(x)  and  F*(x) 
vs.  x  (Figure  3),  <?”(«)  and  Q*(u)  vs.  u  (Figure  4),  Q’(u)  vs.  Q*(u),  a  Q-Q  plot  (Figure  5),  and  F~(x)  vs 
F~{x),  a  P-P  plot  (Figure  6)  which  also  plots  D‘( u)  =  F~(Q~{u)).  Figures  6  and  7  present  D(u)  plots  as  tests 
of  fit  of  Gaunt’s  life  table  by  an  exponential  distribution;  D~(u)  —  cumulative  weighted  spacings  in  Figure  7. 

7.  IQQ  (Identification  quantile  -  quantile)  Plot. 

To  test  whether  a  sample  is  normal  or  exponential,  one  tests  the  hypothesis  Q(u)  =  jj  +  oQ 0(u)  by  a 
scatter  plot  of  (Qo(«(i))i <?"(«(,?)))  at  suitable  values  u (j),j  =  in  the  interval  0  <  u  <  1.  This  plot, 

called  a  Q-Q  plot,  is  judged  visually  for  linearity. 

We  prefer  to  use  what  we  call  a  IQQ  plot;  it  is  a  scatter  diagram  of  (Qo/(u(y)),Q'/(u(y)))  with  a  grid 
of  lines  which  may  make  it  easier  to  judge  visually  for  linearity.  A  IQQ  plot  for  Graunt’s  life  table  is  given 
in  Figure  2. 

8.  Cumulative  Weighted  Spacings  Function  X?(u). 

Users  of  QQ  and  IQQ  plots  report  that  they  are  difficult  to  interpret.  I  propose  that  one  should  prefer 
plots  that  are  graphs  of  functions  such  as  various  functions  D(u),  0  <  u  <  1,  which  can  be  defined  to  measure 
the  ‘distance’  between  two  distributions. 

To  compare  Q( u)  with  /j  +  <rQo(u)  we  recommend  comparing  their  derivatives  (equal  to  ?(u)  and  a^ofu) 
respectively).  Since  a  is  unknown  we  test  for  constancy  the  ratio  q(u)/q0(v)  —  q{u)  foQo(u),  equivalently 
test  the  deviation  from  1  of 

d(u)  =  *(u)/oQo(u)/<r0, 


$ 


*o  =  f  <l{t)foQo[t)dt. 

Jo 


We  call  d(u)  a  weighted  spacings  function,  since  spacing*  X(k;n)  —  X(k  —  l;n)  are  the  building  blocks  of 
estimators  of  ?(u). 

One  approach  to  testing  d[u)  is  to  estimate  and  test  the  deviation  (from  the  uniform  function  D0{n)  —  u) 
of  the  cumulative  weighted  spacings  function 


D{u)  =  [U  d{t)dt 

Jo 


The  sample  analogue  of  d(u)  and  D(v)  to  test  exponentiality  is:  for  u(j  —  1)  <  u  <  u[j),dT( u)  =  eT(j), 
<r(j)  =  -  Q'M  -  !)))(!  -  .5(u(j  -  1)  +  u(j)))/n'\ 

Z)'(ti)  linearly  interpolates  its  values  ZT(u(y))  =  (T(l)  +  . . .  +  d~(j).  Note  that  a0~  =  y.~. 


Table  3.  GRAUNT’S  LIFE  TABLE  Q',  Q~ ,  F~,  F'[Q~)=  D~  FOR  FITTED  EXPONENTIAL  F~(x) 
1  —  exp(—x/n~),fl"  —  18.2,  D~( u)  CUMULATIVE  EXPONENTIAL  WEIGHT  SPACINGS  (CUMWTSPAC). 


j 

Q'Hi)) 

QXj)) 

*-<?>(/)) 

Zr(u(j))CUMWTSPAC 

0 

.09 

0 

.00 

.00 

.00 

1 

8.13 

6 

.36 

.28 

.27 

2 

16.69 

16 

.60 

.58 

.56 

3 

25.26 

26 

.75 

.76 

.73 

4 

33.39 

36 

.84 

.86 

.85 

5 

41.95 

46 

.90 

.91 

.92 

6 

51.26 

56 

.94 

.95 

.96 

7 

63.89 

66 

.97 

.97 

.986 

8 

83.91 

76 

.99 

.98 

.997 

9 

96.54 

86 

1.00 

.99 

1.00 

Figures  6  and  7  show  how  we  plot  D~(u)  for  comparison  with  D0(u)  =  u.  In  addition  to  the  graphical 
diagnostic  of  the  plot,  there  are  many  numerical  diagnostics  that  can  1m  performed. 

9.  Quantile  Simulation  and  Distribution  of  Extreme  Values. 

A  general  distribution  function  F(z),— oo  <  z  <  oo,  is  a  non-  decreasing  function  continuous  from  the 
right.  Its  quantile  function  (or  inverse  distribution  function),  defined  by 

Q(u)  =  inf{z  :  F{x)  >  u}, 

is  a  non-decreasing  function  continuous  from  the  left.  It  is  an  inverse  under  inequality;  for  any  z  and  u 

F{x )  >  u  if  and  only  if  z  >  <?(u). 

An  important  property  of  quantile  functions  is  a  formula  for  functions  of  random  variables.  THEOREM. 
Assume  g  is  non-decreasing  and  continuous  from  the  left.  Then  Y  =  g(X)  has  quantile  function 


One  can  represent  X  in  terms  of  a  uniform  (0,1]  random  variable  U  by  X  =  Q(U)  since  Q{U)  has 
quantile  function  Q(Qu(u))  =  Q(u). 

When  F  is  continuous,  one  can  transform  X  to  U,  a  uniform  [0,1]  random  variable,  by  U  =  F(X)  since 
F(X)  has  quantile  function  F(Q(u))  =  u. 

A  random  sample  X(l), . . . ,  Jf(n)  of  X  can  be  simulated  by  generating  a  random  sample  tf(l),  ■  ■ .  ,  t/(n) 
of  U,  and  forming  X(j)  =  Q(U(j)).  This  process,  illustrated  in  Figure  8  for  the  normal  and  Cauchy  distribu¬ 
tions,  demonstrates  that  the  quantile  function  provides  a  powerful  graphical  representation  of  a  distribution 
because  of  the  following  equivalence:  (l)  a  random  sample  of  X,  (2)  observing  Q(u),  quantile  function  of 
X,  at  a  random  sample  of  points  on  the  unit  interval.  To  compare  two  distributions,  such  as  the  normal  or 
Cauchy,  one  way  is  to  plot  (as  in  Figure  8)  graphs  of  their  identification  quantile  functions  plotted  on  the 
same  scale  (the  longer  tailed  one  will  have  to  be  truncated  at  a  suitable  value). 

The  representation  of  X  in  terms  of  U  by  X  =  Q[U)  provides  a  quantile  approach  to  the  distribution 
theory  of  order  statistics  and  extreme  values.  Let  X(l;n)  <  ...  <  X(n;n)  be  the  order  statistics  of  a 
random  sample  X(l), . . .  ,X(n).  The  kth  order  statistic  X(k;n)  has  the  same  distribution  as  Q(U(k;  n)) 
where  U(k\  n)  is  the  kth  order  statistic  of  a  random  sample  from  uniform  [0,1], 

10.  Comparison  Quantile  Function. 

A  quantile  based  concept  that  unifies  parameter  estimation  and  goodness  of  fit  hypothesis  testing 
procedures  is  the  comparison  quantile  function  D(u)  =  F(G~i(u))  which  compares  two  distribution  functions 
F(z)  and  G(x).  The  comparison  quantile  density  is 

d(«)  =  £/(«)  =  /(Cr1(u))/i,(C7-1(u)) 

The  Kullback  information  divergence  can  be  evaluated  by 

I(G\F)  =  -f  (log (f(x)/g(x))g(z)dx  =  [  -log d(u)du 
J  -  oo  Jo 

« 

The  graph  of  d(u)  provides  insight  into  the  rejection  method  of  simulation.  One  seeks  to  generate  a 
sample  X(l), ....  X(m)  from  F  as  an  acceptable  subset  of  a  sample  F(l), ....  F(n)  from  G(x).  THEOREM. 
Assume  that  £>(0)  =  0  and  there  is  a  constant  c  such  that  d(u)  <  c  for  all  u.  Generate  two  independent 
uniform  [0,1]  random  variables  £/(l)  and  U( 2).  Acceptance  and  rejection  rule:  If 

U{2)  <  d{U( l))/c, 

then  accept  Y  =  G"’(l/(l))  as  an  observed  value  of  X.  Otherwise  reject  Y.  (Continue  by  generating  two 
more  uniform  [0,1]  random  variables).  The  probability  of  acceptance  is  1/c. 

The  relation  between  two  distributions  F  and  G  is  best  understood  by  a  plot  of  U2  =  d(uj). 

This  plot  can  be  used  to  graphically  describe  the  rejection  rule  of  simulation  and  to  prove  it.  Verify  that 
the  area  under  the  curve  from  Uj  =  0  to  uj  =  G(z)  equals  D[G[z))  =  F(x);  the  event  that  t/(l)  <  C(x) 
and  (/( 2)  <  d(U( l))/c  has  probability  F(x)/e\  the  event  that  X  <  x  can  be  shown  to  have  probability  F(z). 

11.  Nonparametric  Estimation  of  Probability  Density. 

To  identify  distributions  that  fit  data,  one  can  use  parametric  models  such  as  the  location-scale  parame¬ 
ter  model  Q(u)  =  h  +  oQq(u),  or  one  can  nonparametrically  form  estimators  /'(x)  of  the  probability  density 
function  (see  Silverman  (1986)).  We  consider  only  the  kernel  estimator 


roo  =  (i/n)£(i/w(*-*o))M) 


where  K(x)  is  a  probability  density  function  and  h  is  a  bandwidth  to  be  selected. 


For  K  we  recommend  (Parsen  (1962))  the  ‘Parsen  window*  which  is  the  probability  density  of  the  sum 

of  four  uniforms  „ 

(4/3)  -  8x*  +  8x3,  0  <  x  <  .5 

(8/3) ( 1  —  x)3,  .5  <  x  <  1 

0,  1  <  x 

K{-x),  x  <  0 

As  a  first  choice  to  consider  for  h,  by  adapting  Silverman  (1986),  p.  47,  we  recommend 


K(x)  = 


hopt  =  K(fi)DQn 


-.a 


To  accept  or  reject  the  goodness  of  the  value  of  h  chosen  we  judge  the  deviation  from  uniformity  of  the 
comparison  quantile  function  D'(u)  =  F*(Q'(u)).  We  evaluate  this  function  at  u  =  (j  —  .5)/n  by  F'{X (j ;  n) ) . 
Other  choices  of  hopt  are  multiples  of  hopt  based  on  diagnostics  of  the  tail  behavior  of  the  distribution,  given 
by  <JI(. 99)  —  QI(. 01).  The  deviation  of  JX[y)  from  uniformity  is  used  to  guide  the  search  for  the  best  value 
of  h  for  the  data  being  analysed. 

The  details  of  this  procedure  for  choosing  a  kernel  probability  density  estimator  cannot  be  given  in  this 
paper.  It  is  best  explained  by  examples  of  the  quality  of  nonparametrie  probability  density  estimators  to 
which  it  leads  for  famous  data  sets  (Buffalo  snowfall,  Yellowstone  geyser  eruption  times)  which  are  used  as 
test  cases  for  density  estimation  methods  (compare  Silverman  (1986)). 

12.  Conclusion. 

The  process  of  analysing  a  univariate  sample  can  be  viewed  as  fitting  a  smooth  distribution  F~(z)  to 
a  sample  distribution  F~(x).  The  process  of  comparing  F~  and  F~  requires  a  knowledge  of  the  theory  and 
practice  of  quantile  functions.  ‘In  order  to  get  to  the  fruit  of  the  tree  you  have  to  go  out  on  a  limb  ’  is  a 
proverb  that  statisticians  may  take  as  an  omen  that  they  should  explore  the  quantile  limb  which  is  always 
lurking. 
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Titles  of  Figures 


Figure  1.  Identification  quantile  plot  of  Graunt  life  table. 

Figure  2.  Identification  quantile-quantile  (IQQ)  plot  of  Graunt  life  table  vs.  exponential  distribution. 


Figure  3.  F~{x),  Graunt  life  table  sample  distribution  (dot)  and  F'‘(x),  exponential  mean  18.22  distribution 
(solid). 


Figure  4.  Q"(u),  Graunt  life  table  sample  quantile  (dot),  and  Q'(u),  exponential  mean  18.22  quantile 
(solid). 

Figure  5.  Q-Q  plot  of  Q~(u)  vs  Q*(ti). 

Figure  0.  P-P  plot  of  F~[x)  vs  F~(x),  same  as  plot  of  D"(u)  =  /’‘(Q'fu)),  Do(u)  —  u  is  also  plotted  (dots). 
Figure  7.  IT(u),  cumulative  exponential  weight  spacings  (solid);  D0{u)  =  u  (dot). 

Figure  8.  Random  sample  from  normal  (top)  and  Cauchy  (bottom)  represented  as  values  of  quantile 
function  Q(u)  at  random  sample  from  uniform  [0,1]. 


Figure  9.  Comparison  quantile  density  d(u)  =  £>'(u),  D(u)  =  Gf_1(u),  F  normal,  G  Cauchy,  d(u)  bounded 
(top),  F  Cauchy,  G  normal  d(u)  unbounded  (below). 
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